Lexical stress detection for L2 English speech using deep belief networks

نویسندگان

  • Kun Li
  • Xiaojun Qian
  • Shiyin Kang
  • Helen M. Meng
چکیده

This paper investigates lexical stress detection for L2 English speech using Deep Belief Networks (DBNs). The features of the DBN used in this work include the syllable-based prosodic features (assumed to have Gaussian distribution) and their expected lexical stress (assumed to have Bernoulli distribution). As stressed syllables are more prominent than their neighbors, the two preceding and two following syllables are taken into consideration. Experimental results show that the DBN achieves an accuracy of about 80% in syllable stress classification (primary/secondary/no stress) for words with three or more syllables. It outperforms the conventional Gaussian Mixture Model and our previous Prominence Model by an absolute accuracy of about 8% and 4%, respectively.

برای دانلود رایگان متن کامل این مقاله و بیش از 32 میلیون مقاله دیگر ابتدا ثبت نام کنید

ثبت نام

اگر عضو سایت هستید لطفا وارد حساب کاربری خود شوید

منابع مشابه

Automatic lexical stress and pitch accent detection for L2 English speech using multi-distribution deep neural networks

This paper investigates the use of multi-distribution deep neural networks (MD-DNNs) for automatic lexical stress detection and pitch accent detection, which are useful for suprasegmental mispronunciation detection and diagnosis in second-language (L2) English speech. The features used in this paper cover syllable-based prosodic features (including maximum syllable loudness, syllable nucleus du...

متن کامل

Automatic Classification of Lexical Stress in English and Arabic Languages Using Deep Learning

Prosodic features are important for the intelligibility and proficiency of stress-timed languages such as English and Arabic. Producing the appropriate lexical stress is challenging for second language (L2) learners, in particular, those whose first language (L1) is a syllable-timed language such as Spanish, French, etc. In this paper we introduce a method for automatic classification of lexica...

متن کامل

Integrating acoustic and state-transition models for free phone recognition in L2 English speech using multi-distribution deep neural networks

This paper investigates the use of Multi-Distribution Deep Neural Networks (MD-DNNs) for integrating acoustic and statetransition models in free phone recognition of L2 English speech. In Computer-Aided Pronunciation Training (CAPT) system, free phone recognition for L2 English speech is the key model of Mispronunciation Detection and Diagnosis (MDD) in the cases of allowing freely speaking. A ...

متن کامل

Prosodic Differences between Taiwanese L2 and North American L1 speakers— Under-differentiation of Lexical Stress

Assuming that categorical differentiation is major acoustic characteristics of English lexical stress through binary instead of more complex 3-way distinction, we investigated lexical stress in broad and narrow focus positions and found how binary distinction is achieved by the concomitancy of secondary stress defined by its position and distance in relation to primary stress. Similar results a...

متن کامل

English Lexical Stress and Spoken Word Recognition in Korean Learners of English

Two experiments explore how Korean-speaking L2 learners of English process English lexical stress during spoken word recognition. Korean doesn't employ lexical-level prosodic distinctions like English lexical stress, but it has phrase-level prosodic structure ((T) HLH), with the initial tone determined by the phonation type of phrase-initial sound. Results from eye-tracking and gating experimen...

متن کامل

ذخیره در منابع من


  با ذخیره ی این منبع در منابع من، دسترسی به آن را برای استفاده های بعدی آسان تر کنید

عنوان ژورنال:

دوره   شماره 

صفحات  -

تاریخ انتشار 2013